Topic Tracking Based on Keywords Dependency Profile
نویسندگان
چکیده
Topic tracking is an important task of Topic Detection and Tracking (TDT). Its purpose is to detect stories, from a stream of news, related to known topics. Each topic is “known” by its association with several sample stories that discuss it. In this paper, we propose a new method to build the keywords dependency profile (KDP) of each story and track topic basing on similarity between the profiles of topic and story. In this method, keywords of a story are selected by document summarization technology. The KDP is built by keywords co-occurrence frequency in the same sentences of the story. We demonstrate this profile can describe the core events in a story accurately. Experiments on the mandarin resource of TDT4 and TDT5 show topic tracking system basing on KDP improves the performance by 13.25% on training dataset and 7.49% on testing dataset comparing to baseline.
منابع مشابه
A generalization of Profile Hidden Markov Model (PHMM) using one-by-one dependency between sequences
The Profile Hidden Markov Model (PHMM) can be poor at capturing dependency between observations because of the statistical assumptions it makes. To overcome this limitation, the dependency between residues in a multiple sequence alignment (MSA) which is the representative of a PHMM can be combined with the PHMM. Based on the fact that sequences appearing in the final MSA are written based on th...
متن کاملA Query Language of Data Provenance Based on Dependency View for Process Analysis
For the scale of data in process keep increasing, data provenance also becomes large and constantly growing, which brings challenges to the efficiency of provenance tracking in process analysis. This paper proposes a kind of dependency view to extract a global data provenance description of the data process instance, and then defines a contextual query language based on dependency view to imple...
متن کاملLexical Chains versus Keywords for Topic Tracking
This paper describes research into the use of lexical chains to build effective Topic Tracking systems and compares the performance with a simple keyword-based approach. Lexical chaining is a method of grouping lexically related terms into so called lexical chains, using simple natural language processing techniques. Topic tracking involves tracking a given news event in a stream of news storie...
متن کاملTarget tracking in the recommender space: Toward a new recommender system based on Kalman filtering
We assume that users and their consumptions of television programs are vectors in the multidimensional space of the categories of the resources. Knowing this space, we propose an algorithm based on a Kalman filter to track the user's profile and to foresee the best prediction of their future position in the recommendation space. The approach is tested on data coming from TV consumptions. Keywor...
متن کاملAn Adaptive Local Dependency Language Model: Relaxing the Naı̈ve Bayes’ Assumption
We describe a new probabilistic approach in the language modeling framework that captures adaptively the local term dependencies in documents. The new model works by boosting scores of documents that contain topic-specific local dependencies and exhibits the behavior of the unigram model in the absence of such dependencies. Contributions of the current work include adapting van Rijsbergen’s (va...
متن کامل